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I. INTRODUCTION 


This thesis is an attempt to understand the software 
cost estimating process as viewed by a program manager 
during the early phases of system development. 

Software cost estimators have problemas because of their 
inabili ty to estimate the size of new programs. 
Bistorically-based methods are difficult to use because the 
estiaator is unable to assess skills, tools, complexity, and 
environsent of past developments and of the new software. 
The results are severe cost overruns, schedule slippage, and 
lack of creditility. (Ref. 1] 

In software estimating models, software size is the key 
parameter influencing cost. All the proven estimating teck- 
niques begin with an estimate of the size or the software 
package and then at various levels of sophistication produce 
an estimate of cost or tige based on size and calculated or 
derived productivity factors. 

A quick, reliable estisate of size could provide a 
better cost estiaave for planning purposes. The prcgran 
Banager's concern is time, cost, and perforaance of the 
total systea. He needs to present this information to higher 
DOD management levels during the early planning phase. 

This paper 1S an investigation of ways in which an esti- 
aator gay be able to estimate the size of a new project, 
based on an existing data base of close to 1,000 systeas. 
Breakouts have been made of that data base by applicatioa 
type and the average size and the standard deviation of each 
of these applications is calculated. In the early phase or 
systea developrent, a reasonable goal would be to estimate 
time withın 15 percent and effort within 39 percent of 
actual. 
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The program sanager should have a detailed understanding 
of the process that developed the cost estimate and should 
have sose traceability to the variance or sensitivity of the 
estimating component. The use of automated software models 
for estimating has become popular in industry and ,DOD. 
Program managers can compare different rodels for alterna- 
tive estimates. 

All the variables of the software equation are subject 
to some degree of uncertainty and the program manager must 
have a means of taking this into account in an effort to 
development risk profiles. Putnam (Ref. 2] suggests that 
risk analysis is probably the most important aspect cé any 
software system analysis. In an uncertain process, risk 
analysis measures that uncertainty. This leads to strat- 
egies tc BiniBize risk. The program manajer should perforn 
risk analysis and determine the probability of meeting 
schedule/cost. 

This paper uses SLIM (Software Life-cycle Management). 
The SLIM rodel uses some of the most powerful tools or anal- 
ysis available today -- linear programming, Monte Carlo 
simulation, and algorithms from the PERT methodology -- to 
produce the test possible solution to the software esti- 
matirg proLles. (Ref. 1) 

The program manager predicts and controls schedule and 
costs. It is iaportant Tor the program manager to recognize 
potential problemas early and take effective corrective 
action. 
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II. SQPIWABE LIFE-CYCLE AND DEVELOPMENT PROCESS 


The software life-cycle may be said to start in the 
perception of a need and to terminate when the systen is 
retired as otsolete. The prcgram manager has the responsi- 
bility from start to end for the project. The program 
Banager aust understand the software life-cycle and its 
developsent process. A lack of understanding of the process 
of software development (activities, phases, and wilestones) 
causes a poor estinate. The software development process 
can be subdivided into seguential and cverlappitg phases. 

Figure 2. 1 [Ref. 2] depicts the total system milestones 
in relationship with four software cost models considered. 
The DOD subdivides the life cycle of an automated inforaa- 
tion system into five broad phases: (Ref. 3] Mission 
Analysis Project Initiation, Concept Development, Defirition 
Design, Systes Development, Deployment Cperation. 

Putnam [EFef. 4] divides it as: Systens definition, 
Functional design specification, Development(design and 
coding, test and validation), Operation and maintenance. 
The developaent process is divided as follows. 

POR - Preliginary Design Review. This is the earliest 
time that a fcraal review of the functional design specifi- 
Cations can be expected to be satisfactory enough tc 
continue into the next phase of developaent. Functional 
desigr and (high level) system engineering is essentially 
coaplete. 

CDR = Critical Design Review. This is a review of the 
detailed logic design for each element of system. The 
design consists ct tlow charts, HIPO! diagrams, Fseudo-code 


THI FQ ا ا و ا‎ pub) was developer at 
IBM as design representation schemes for top-dowr software 
developsent ard contained a visual table of contents, a set 
ot over viev overview diagrams, anc a set of detail diagrams. 
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figure 2.1 Software Life-Cycle. 


logic, or equivalent. It is held when design and coding are 
separated by manageaent decision as by MIL STD (military 
standards). Coding cannot start until after a successful 
CDR under tbis philosophy. There must be sufficient design 
to start coding. 

FCC - Pirst Code Complete. In a top-iown, structured 
design and coding environment, PCC is the time at which all 
the units of code have been written, the units have been 
peer and management checked, success fuliy coapiled and run 
as units, and thought to be satisfactory end product code. 
It is then entered into a library of completed code. (Note: 
coding will ccntinue thereafter as rework of these modules.) 
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SIT - Systems Integration Test. This is the earliest 
time that all elements and sutsystems have been put together 
and the system can work together as a complete integrated 
package and can demonstrate that in a formal system test. 

UOSI- User Oriented System Test. Pclloving correction of 
deficiencies resulting from SIT, UOST is the first time that 
a test of the system in a full user environment -- target 
machine and operating system, real data, real operating 
conditions -- can be conducted. 

IOC - Initial Operational Capability or start of instai- 
latior, depending on the environment. This is a careful, 
tentative first use under rigid contrel. Ofter this is a 
first site installation ina live environment with antici- 
pated later zulti-site deployment, or the start of operation 
in parallel  witn the predecessor system in a single site 
replaceaent environment. 

Foc = Full Operational Capability. Here the systen 
zeets specified quality standards sufficiently wel trat 
organizations will use it in everyda; routine mission oypera- 
tions. (Te SM), that is a 95 percent reliability levei; 
calibration and technology factors are normalized to this 
reliability level.) 

Boebsa (Ref. 5] divides the cycle into feasibility, plans 
and requirements, product design,  programaing, integratior 
and test, and maintenance. Tke primary emphasis cf his 
COCOMO? aodel is on the development portion of the life- 
cycle which CCCOXO defines as starting " at the beginrirg of 
the product design phase and ending at the end of the soft- 
ware integration and test phase ". (Ref. 5] 





?eCOnstructive Cost MOdel, a hierarchy or three iacre2as- 
ingly detailed models ( basic, intermediate, detail) «Lich 
range from a ውክ ከ Ro scaling model as a 
function of product size to a aicrc-estinationa model with a 
three-level work breakdown structure and a set of phase- 
sensitive multipliers for each cost driver attribute. 
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The PRICE-S software model is one of the family of RCA 
cost predicting rodels. PRICE-S divides the software devel- 
opment cycle into three phases: design, implementation, and 
test and integration. | 

Tbe question facing the program manager 35 what to 
invest in and what is its payoff. Boehm {Ref. 5] developed 
a value-of-inforsation guideline with the following five 
condi ticos: 

1. There exist attractive alternatives whose payoff varies 
greatly, depending on some critical state of nature. 

2. The critical states of nature have an appreciable 
probability of occuring. 

3. The investigations have a higb probability oft 
accurately idettifying the occurrence of critical 
States of nature. 

4. The required cost and schedule of the investigations 
do not overly curtail their net value. 

5. There exist significant side benefits derived fror 
performing the investigations. ۱ 

The major difficulty with these guidelines is deter- 
Bining what are the critical states of nature. Boeha implies 
some of these states in his definition of feasibility fhase 
and plans and requirements phase (Bef. 5): 

Fea sibility phase: How auch shoula ve invest in informa- 
tion systea (user questionnaires and interview , current- 
systems analysis, workload characterizataons, simulations, 
scenarics, prototypes) in order that we cocverge on an 
appropriate definition ard concept cf operation for the 
system we plan tc iaplegent? 

Plans and requirements phase: How rigorously should we 
specify requirerents? How much should we invest in require- 
ments validation activities( automated completeness, corsis- 
tency, and traceariiity checks, analytic -odels, 


simulations, prototypes) before proceeding to design ard 
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develop a software systen? It is important for the progran 
manager to know the early phases of systema development. 

Boeha [ Ref. 6] points out the uncertainty of cost esti- 
nation during the early project phase. Fig 2.2 shows the 
accuracy within which software cost estimation can be made, 
asa function of tke software life-cycle phase ,or of the 
level of knowledge we have of what the software is intended 
to do. At the beginning of the feasibility phase, the rela- 
tive range of scftware cost estimates is roughly a factor of 
four’ on either the high or low side which is not surprisiry 
based on the lisited knowledge available at this point. The 
estimation uncertairty is reduced to from .5 to 2 after the 
concept of operation has been defined. after completion of a 
requirement specification, the estimation range is .67 to 
1.5. Gradually, the estimation range becomes smaller until 
we finally arrive at acceptance. 

Fairley's { Ref. 7] suggested life-cycle approach divides 
into four models: The phased model, cost model, prototype 
life-cycle ncdel, and successive versions model. Costs 
incurred witkin each phase include the cost of performing 
the process and preparing the products for that pnase, plus 
the cost of verifying that the products of the present phase 
are coBplete and consistent  vitn respect tc all previous 
phases. Fairley also suggests there are some reasons for 
developing a prototype: (1) to illustrate input data 
formats, message, reports , and interactive dialogues for 
the customer; (2) to explore technical issues in the 
proposed product; and (3) in situations where the phased 
zodel or analysis -> design -> implenentation is not appro- 
priate. Product developsent by the method of successive 
versions is an extension of prototyping in wnich an initial 
product skeletcn ıs refined into increasing levels of 


3The ranges are intended to represent 40 percent conti- 
dence liaits, that is , " within a factor of four cr either 
side, 89 percent of the time." 
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figure 2.2 Software Cost Estimation Accuracy vs. Phase. 


capability. 


Putnam [Ref. 3) suggests that a good life cycle nodel 


should possess these characteristics: 
1. Consider all activities and phases. 


2. 


3. 


4. 
5. 
6. 
?. 


8. 
9. 


Relate aanageBent parameters to aanageaent 
responsitilities. 

Be adaptive to actuzl project data and requireaents 
Changes( i.e. aust be tiae-varying or dynaaic) 
Provide engireering accuracy. 

Provide sensitivity profiles. 

8e phenogsenologically based. 

Relate produce to resource consuartion (both statically 
and dynamically) the technology being applied. 

Be capable or future growth. 

Be able to adequately treat known and future systea 
tyres and developzent environaents. 
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Software life cycle and development processes offer a 
structured means for planning, developirg, and controlling 
the software project. Boehs (Ref. 9] suggests that the 
tradeoffs between lower developaent costs and lower life 
cycle costs are characterized by the modern prograuminy 
practice and required reliability cost estimating relation- 
sbips fcr software developsent and maintenance. The progran 
manager must understand the software life-cycle and develop- 
ment process cf multiple models and compare with DOD life- 
cycle. The program aanager must predict and control schedule 
and costs. Alsc tne programs  3anager must recognize poten- 





tial problems early and take effective corrective action. It 
is iaportant for the prograg manager to miniaize the cost/ 
schedule impact of requirements changes. 
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III. ESTIMATING TECHNIQUES AND COST ATTRIBUTE FACTORS 


In order for the program manager 583 ከ35 support team 
to evaluate a software cost proposal, they must have a 
detailed understanding of the process that developed the 
Cost estimate and should have some traceability to the vari- 
antes or sensitivity of the estimating components. 





The object of software cost estimation is to determine 
what resources (manpower, computer time, and elapsed time) 
will be neeaed to produce the software associated with the 
project. Stanley [Rer. 10] listed some reasons for not 
oktaining a gcod estiaate as: 

1. A lack of understanding of the prccess of software 
de velopaent. 

2. A lack of unierstand1ng of the effects of various 
tecanical and management constraints. 

J. A view of that each project is unique, which 
inhibits project to project corparisons. 

4. ል lack cf historical data against whica the uodel 
can be checked and for calibration. 


Boeha (Ref. 5] suggests that a good software  aodel 
Should possess the following characteristics: 

1. Definiticn: Can you tell what it is estimating? 
will different people give similar factor rating? 

2. Fidelity: Ace the actuals close tc tne estimates? 

3. Detail: Does it give (accurate) phase and activity 
breaklowns? 

u. Constructiveness; Can you tell why it gives the 
estimates it does? Does it help ycu understand the 
software job? 

5. Objectıvity: Is it harà to jigger the model to gc 
ary result you want? 

6. Stability: Do small input changes produce small output 
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ch anges? 

7. Scope: Does the model cover your class of software 
project? 

8. Easy to use: Are the model inptts, and outputs easy to 
understand aad specify? 

9. Prospectiveness: Does it depend on information not 
known until later? 

10. Parsigory: Does it use redundant or unnecessary 
factors? 

11. Availability: Can I get access to the model? 

The prograa sanager coBpares various cost models fo: 
alternative estimates. The model should allow for the use 
of historic data in calibcaticn for a particular orjganiza- 
tion and type of sortware. A jood software cost estimation 
model should cover software engineering issues which arise 
througkcut the software life cycle. 


A. TECHNIQUES 


According tc Stanley (Ref. 10], most cost models use one 
or both of the following equations. The first is cailed the 
cost equatior: 


E E nad 

where E = effort needed, 5 = size of program is teras o: 
some measure of lines of code and a and b are chosen by 
curve fitting on a historical database. Different values of 
a and b are appropriate to different organizations, project 
types, units of measurement cf E and S, and iteas inclulel 
in the the estiasates. 

The second equation is called the general sSuaminy 
equation: 
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where tbe a. are input parameters derived from the descrij 
tion of software characteristics and the characteristics 
the development environment, and the value of f; are chosen 
by curve fitting on a suitable historic database. Table 1 
(Ref. 7] shows a comparison of effort and development tine 
eguations accordiag to software size. Some models provide 
eguations to estimate total Ban months of effort, MM, in 
teras of the number of thousands of delivered source 
instructions, KDSI. Also the development tire for software 
project, 70537, measured in terms of MM. A software project's 
cost can be obtained by multiplying tne effort in man months 
by the cost per Ban month. 





TABLE 1 
COMPARISON OP EPPORT AND TIME EQUATIONS [Ref. 7] 





| 
| Effort eguation Development tine Author | 
| MM = TDEV = | 
| EL TLE: 20.91 2.847 (MM) **0.35 dalston | 
4.9 (KDSI) #*0.98 3. 08 (MM) **0.36 Nelson 
1. 5 (KXDSI) **1.22 4.38 (484) **#0.25 Frebur ger 
er KDSI in 2-30 MM) **0.38 3oehn 
.O (KDSI) **1.1 2.50 (MM) **0.35 Boehn 
| 3.6 (KDSI) **1.20 2.50(M4) **0. 32 Boehn 
| 1. 0 (KXDS I) **1.40 - Jones 
0.7 (KDSI) **1.50 = Halstead 
| 28 (KDSI) **1.83 - Schniedec 





| 


Thibodeau (Rer. 11] states that parametric models may te 
divided into three classes: 
1. Regressicn model: The parazeters to be estimated are 
mathematically related tc a set of input parameters. 
The parameters of the hypothesized relationships are 
arrived at Ly statistical analysis aad curve fitting 
or an appropriate historical datatase. 
2. Heuristic acdels: Here ctservation and interpretation 
or historic data are combined with supposition and 
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experience. 

3. Phenomenological model: This is based on a hypothesis 
that the software development process can be explaired 
in teras of some more widely applicable process or 
idea. 


The first model class seems tc be the approach used in Bost 
ccst modeling. This class includes the Aerospace, Doty, 
Farr, Zagorski, and Telecote 200615. The second model class 
seers to be the type used in preparing a proposal where 
detailed WBS «leaents are prepared and suamed for the total 
cost. The Boeing and PRICES models along with the DOD 
MICRO Procedure and the Wolverton model aave been termed 
heuristic. An exaaple of the last class is the Putnaz 
model. (Ref. 11] 
Stanley (Ref. 10] suggests a general pattern followed dy 

all the models: 

1. Estirate software size 

2. Convert size estimate to labor estinate 

3. Adjust estimate for special project characteristics 

4. Divide the total estimate into the different project 

ph ases 
5. Estimate non-technical lakor costs 
6. Sum the cost 


4ost models start from an estimate of project size. Soze 
models convert from size to labor, others go lirectiy fron 
size to money estimates. The effective estimate is an 
adjustment of the basic estimate intended to take account or 
any special project characteristics which make it dissiailar 
to the pattern absorbed in the underlying historic database. 
Each aodel which deals with a project's schedule makes 
assumptions about the allocation ot effort in different 
project phases. 
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B. SOME SPECIPIC TECHNIQUES 


Proa the CS4* data base, ve perfora curve fitting to get 
the relationship of softvare size versus developasent tine 
and effort using the regression aethod. Despite the vide 
ranges, the developasent time anl effort correlate very vell 
with the nuater of source statements. Figure 3.1 shows 
size versus developaent tine froma the curve-fitting aethod. 


Mixed Application Data Base 





Pigure 3.1 (SB Database Software Size vs. Developsent Tine. 


To estisate scftware size, the PERT method is suggested. 
The PER; equations estimate the expected size (ES) and stan- 
dard deviatica ( 6 ) of each subsystea as 


ES a 8 ቀ 458 ቀ ከ ። 6 = bdb >- e 
i ፈል” ٠ 1 e 6, =, 0% 


where 





“Quantitative Software Managenaent, Inc. 
SPrograrn Evaluation and Review Technique. 
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a = the lowest possible size of tne software subsysten 
i 

7/۸ = the most lixely size of the ناک‎ 3 ٣۶۵ 

b = the highest possible size of the subsysten 
1 


The estimated total software size (S) and standard deviation 
(OS) are then 


0 2 1/2 
B. pe^ f ES. 5 ሪ5 (bs, d 


This PERT sizing technique requires more work to break up 
the software into subsysteas and to estimate most likely 
size for each subsystem as well as to eliminate upper and 
lower limits. PERT estimates should be aade ty knowledge- 
able software designers, preferably those who will do tke 
specific work. 


C. COST ATTRIBUTE FACTORS 


Stanley [Ref. 10] defines two major area or software 
cost drivers: project specific factors and organization 
dependent factors. Boehm ( Ref. 9] identifies five facto. 3 
which closely ratch Stanley's. Size attributes, program 
attributes and computer attributes fall into Stanley's 
project specific factor. Personnel attributes and project 
attributes fall into Stanley's organization-derendent 
factors. dolverton [ Ref. 12] suggests that top-level char- 
acteristics are parameters which can ke classified into 
software structural paraaeters and project financial parane- 
ters. Sortware structural paraaeters nay be divided size, 
progran attritutes, hardware attributes, project attributes, 
environmental attriLutes. Project financial parameters are 
divided into direct labor charges, overhead, other direct 
charge, general and administrative expense, and fee. 
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Bruce and Pederson [Bef. 13] divide cost drivers into 
four categories: requirement factors, product factors, 
process factcrs, and resource factors. 


1. Reguiresent Factor 
Two factors that can have a major impact on the 


ultisate cost of a software product are (1) the guality of 
the specifications and (2) the stability of the requirezent. 


a. Quality of Specifications 


A good definition of requirements is the corner- 
stone of a well-defiaed, well-understood, and well-costed 
sortware develcpment. Inconplete requirement definition is a 
aajor cause of cost overruns. 


b. Stability of Requirements 


The responsibility of the project manager is to 
understand the software requirements and to point out that 
changes in the requirement baseline are changes. The 
project manager should then defire the cost and/or schedule 
impact so that the change can be given a [air evaluation. 


2. Prodyst Factors 


a. Software Size 


à favorite measure for software system size is 
lines of operational code or deliverable code. Parametric 
cost estimating models relate cost in some way to the size 
estinate. 


b. Difficulty 


After considering the relative ir:iciculty of 
various software systems and functions, the cost estimator 
must consider the difficulty of the specific application and 
its heritage tc coaplete the analysis of the difficulty 
factor. 
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ae Management Structure 


Management structure incorporates the effects oz 
various company policies with respect to allocating costs 
for certain non-project nanagement personnel asa direct 
charge to projects. 


b. Management Ccatrol 


This covers the cost of project support in such 
areas as mnanagenert ¡information  prccessing, scheduling 
support, administrative support, and clerical support. 


c. Development Methods 


This factor attenpts to quantify the imvact of 
various development methods. The development metnods ofr 
interest include such approaches as top-down design ani 
testing, stroctured programming, use of chief programmer 
teams, and use of the structured valk-throughs. 


d.  Tcols 


The program manager must consider how the soft- 
ware vill be developed, tested, and maintained and what 
tcols will be aeeded to accomplish these tasks. For systens 
developed for large-scale comfuter, a host of coaplier, data 
base managers, editors, display interface package, flow 
Chart package, plot package, utility routines, and test data 
generation tocls are generally available. The costs associ- 
ated with tools are a function of the tool complexity, use, 
features, and, maturity. 


e. Available Software 


A significant reduction in project can be 
achieved through the use of existing software. The costs of 
modifying the existing software can then be determined 
sub jectively. 
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£. Data Base 


The size, complexity, and special file access 
requiresents for the data base are extremely iaportant 
parameters in deriving an accurate software development 
estisate. 


4. Resource Factors 


Developsent costs for a given software package may 
vary substantially, depending on such factors as experience 
of available people, quality of project staff, and avail- 
ability of develcpsent cosputer resources. 


a.  Numter of People 


The major contributor to the reduction in 
productivity associated with projects that require larger 
staffs is the increase in time seeded for communication 
between people. 


b. Experience of People 


The program manager must consider the general 
level of experience and skill required for various project 
assignment and incorporate appropriate labor rates into the 
estimate. 


G. Personnel Performance 


Since software developsent ıs an analytical, 
scaetines creative activity requiring abstract reasoning, 
individual productivity variations are to be expected. 
Productivity assessment is extrenely important because cost 
estimation generally is reduced to deriving a productivity 
figure per unit of effort per person for a given skill 
category. 
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4. Availability of Coaputing Resources 


For batch developeent systeas there is a iinear 
relationship between turnaround tiee and testiag costs. for 
interactive development systeas, significant Jevelopaent 
efficiencies are often reali red. 


e. Suitability of Coeputiag Resources 


The asyaptotic effect on developaent costs as 
the hardvare speed and Beaory sire constraiats are 
approached has deen desonstrated in batch, real-tiee, 
&icborae, Bilitary, and coaaercrcial systea. 


f. Elapsed Tite 


The amount of calender tine available to develop 
a software proauct is iatieately related to the uitihBate 
development cest. Below the threshold, the increased staff 
required to accomplish the job ia a shorter period Nas the 
opposite of the desired effort. 
Table 2 (ef. 6) shows the various site, ۳۴۵4۰ 
coaputerc,  percsonnei, aed project attribites used ty each 
aodel tc determine software cests. 


Numerous factors influence the cost of a software 
developaert project aad each should be evaluated during cost 
estiaation to ensure that proper weaghting is applied to the 
cost estiaates. Curreat eodels differ 10 the factors that 
are required as stecific inputs. Nany ditfereat factors aay 
ve sudseped if a single paraneter ih sore BOlels, particu- 
lariy the todeií subjective paraneters, The prograe aanager 
should assess existing perzoanel/facılıty resources and 
detereinme future requirements. Also the program aanager 
&easures overtead and deteraine the efficiency of resource 
allocations it aultıpie developanent task projects. 
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TABLE 2 


COST DRIVERS USED IB VARIOUS COST 5 [Bef. 6| 
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It is iapcrtart for the program manager to develop reli- 
able estimates by  perfcraing parallel calculations on 
sultiple aodels. 





IV. SLIS 


The SLIM model is a cosfrehensive software cost esti- 
sating’ package produced by Quantitative Software Management 
Inc. SLIM is an aid for effectively managing software 
development. Using engineering techniques, SLIM provides 
Cost, time, personnel, and machine estimates for developing 
computer software systems. SLIM identifies limiting 
constraints that affect development plans. Conridence 
levels and risk factors are calculated to provide the 
Banager with the data needed to make decisions on cost, 
schedule, effort, quality, manloading, and cash flow. The 
SLIM , software costing and nanagement system does these 
things (zef. 2): 

1. Deteraines the size of the systea in Source statenents 
2. Estimates the people, cost and schedule for the prcject 
3. Projects cashfiow over tke life cycle 
4. Identifies limiting constraints on manpower apa 
schedule 
S. Obtains risk projections for cost and schedule 
6. Updates estimates from the real data once development 
is underway 
7. Dynamically adjusts for requirements changes to give 
new cost and schedule 
The SLIM model is claised to Le valid for any systen 
where at least two of the following four criteria apply: 
1. 5000 lines cf code or greater 
2. $100,000 or zore in development costó 
3. Peak aaufower cf 3 people Or more 
4. Developsent tiae of 6 months or longer 
Over sixty large organizations such as DOD, 13M, and GIz 
have used the SLIM package. 511155 accuracy has been teste} 
ror over 1390 systens of all types in the Jnited States ani 
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6. Rayleigh management paraaeters can be easily turned 
into a linear prograa with important aanagesent 
constrairt(manpower, cost, schedule) properly 
considered to yield constrained optimal solutions. 

Figure 4.1 [ Ref. 2) shows the Rayieigh function as a manaye- 
ment tool. 


MANPOWER 


SLCPE = MANPOWER SUILDUP RATE (K/t ክን 


ay = INCREMENTAL MANPOWER 


ax » INCREMENTAL TIME 
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Y MANPOWER AT ANY TIME 
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Pigure 4.1 Bayleigh Function as Software Managenment Tcol. 


The Bayleigh eguation is linearized by taking loga- 
cithas, 


Ln(y/t) = La (K / ,.፦. ቀ (-1 / 2 ፖለ ) E 


Jpor plotting tais equation in terms cf manpower applied to 
a systes over tine squared, a straight line is provided ir 
which Ln( k م‎ ty? ) is the intercept aad (-1/ 2 ta? ) is 
the siofe. Putnas [Bef. 87] performed this operation for ore 
hundred systess ard fouad that the argument of the intercept 
( "0 / ty?) has a sost interesting property. Putnan 
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(Ref. 1) suggests that the ratio ሺ / ىمع‎ represented the 
difficulty, D, in teras of the programming effort and the 
time to produce it. By taking the gradient of D, he gets 


| 9 8| . K / * = Constant 


This difficulty gradient reflects the organizational capa- 
bility in doing that type of work for which the constant was 
derived. The difficulty in the developaent of the software 
varies inversely with the value of the gradient, i.e, the 
aost difficult projects have the smallest gradient. Froan 
observation, Putnaa (Ref. 2] suggests ¡VD | is quantized at 
the following levels; 7.3 foc new developments with inter- 
faces to other prograss; 14.7 for a new stand-alone develop- 
sent project; 26.9 for a re-building of an existing prograr; 
55.0 fora coaposite systeant; and 89.0 for a couposite 
5756682 containing auch existing code. These values's 
uncertainty is about 15 percent of the base value. When the 
difficulty is plotted against productivity rate, PR*, for 4 
nuBbec of systeas, Putnam gets the relationship 


- 2/3 
PR ። 5 ሀ ' 


C = productivity constant 
8 


The area undar the coding rate, F curve? is the total 
quantity of final end product source statements that vill be 
produced by tise t (Ref. 5). That is, source statesents are 
produced during the design and coding subcycle of Rayleign 
curve. Thus, the inteyral of the manpower rate for thia 
subcycle multiplied by the productivity gives source state- 
sents. Frogs the above reasocs, rutnaa (Ref. 1) develops a 





total end product code / total effort to producy cole 
"Productivity rate « Banpovec ٭‎ PR a Y 
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Bathematical relationship, the SLIM software equation, which 
is an powerful tool for planning and evaluating the develop- 
ment effort lite cycle. The software equation is: 


Sears, . عه‎ ٠ [WF ባ ፡ “ዘ »። ቫ.። .گم‎ (7) ٠ dt 


S = number of delivered source instructions 
5 


2 = State of technology constant 


E. 5 is equivalent to Bayleigh equation parameters 


Putnaz (Ref. 15] observed that the time at which the 
Rayleigo curve reaches its maxiaua value, tq, corresponds 
to the tiae cf system testing and product release for many 
software products. That is, normally reak manpower, Y aax, 
exists as a function of developsent tise (ty), 

ርድ EIER ےم‎ 

The area under the Rayleigh curve interval represents the 
total effort expended in that interval. Approximately forty 
percent of the area under the Rayleigh curve is to tbe left 
of ty and sixty percent is to the right. By rearranging the 
software equation and applying a factor or .4 to reflect 
that the developsent effort, E, is approximately the first 
forty percent of the life cycle curve, 


4 9 و9 
t à‏ / )= ا = E (BU, NY) = K‏ 
k‏ 
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By applying the burdened labor rate (average dollar 
cost per san year or Ban month of effort), the equation 
changes to cost 


i 8 ጓ3 u 
Development coat = ( 8 / MY) .4 | 7 / ~ 


B. UNCORTAINTY ANALYSIS 


Almost every factor entering into an estimate of effort 
and schedule is subject to acae degree of uncertainty. The 
Banager needs to portray the effects of the uncertainty 
associated with each of these factors. To get a reasonable 
estinate of uncertainty in life-cycle effort and development 
time, Monte Cario siaulation is used. Given Cy Ss 63, , 
IYDI, and ۷۱ Putnam ( Bef. 2] obtained statistica on 
variablity of the e£fort and develop time from several thou- 
sand saaples. Froa the SLIM software equation and the diffie 
culty gradient eguation, we baka the equation as follows? 


(e) 


t . ትፕ፦ 
d 
K "ት 3 t 3 
Ivo | 4 


Also the standard deviation ( Gta » OR ¦) obtained tron a 
Monte Carlo siaulation vill be used to aake a risk analysis 
profile. An APL proyrae for this siaulation analysis or 
estisating developgsent tine, effort, and the ua jor sileatone 
tiae is shown io Figure 4.2. 
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APL Prograa for Dewelopsent Tise and Effort. 


Pigure 4.2 


Linear prograaaing is used to obtain the two aost iapor- 
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resource allocation and control (Ref. 2). The equatiors for 
the LP forsulation are as follows: 


$ w C K t software equation 
5 k a 
"IET LT saximun peak manpower 
d = sax P P 
K/t > Jw 7 sinisun peak Banpower 
d = sin P ۲ 
2 
K / b < | 0 | gaximum difficulty 
3 | 
ሺ / ~ €« ١9 2 ١! maxisua difficulty gradient 


t < cortract delivery time 
= 


($ / NY) ( .46 K) < total budgeted amount for development 


These can ce solved with graphic or siaplex methods. 
Figure 4.3 shows LP graphical feasibility solution using 
LOG-LOG transformation plot. (Ref. 2) 


C. AVAILABLE OPTIONS 


The SLI4 function routine is cosposed of three options: 
top-level, what-ir option, and isplementation option. 
Pigure 4.4 shcus the SLIM sethodology. [Ref. 4] 
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Pigure 4.3 LP Graphical Solution (ref. 2. 
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Figure 4.4 SLIM Diagran (Ref. 4. 
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1. Tapsleyved Qption 


The top-level functions in SLIM include: Calibrate 
iaput, project iaput, and project estiasate. The process by 
which a odel ia tailored to a particular clase of syateasa 
with the intent of making it a better predictor is called 
calibration. It is important for the progras sanager to try 
to tailcr his aode) froa his past data. 


a. Calitrate Input 


Allows user to calibrate historic projects for 
productivity and gsanpower buildup indexes. 


b. Project Input 


Proapts user for all project estiaate inforaa- 
tion and duilde or sodifies project date files. 


C. Project Estinate 


Provides all cost, schedule, aanpower, quality 
and riak estimates for a eoftvare project. Once a aloiaus 
tiae solution has been established the nanagenent what-if 
options can be used to ispose project constraints. then an 
acceptacle aclution ie chosen the Laplesentation options cau 
generate 1 consistent set cf work plana. 


2. Fùal=4 Q9۵ 


It da iaportant for the prograa aanager to try 
sensitivity analysis. Unlese it ia essential that the soft- 
were be built in the ainisue tiae, the «xtimator should 
consider alternative solutions that take longer but coat 
less. The what-if functions provide the avility to explore 
additional build strategies that take longer and could 
better suit the estairgator’s ۰ص0‎ The @QanayeBent what-is 
options inci ude i! 
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a. Design to Schedule 


SIIM will automatically determine the 0-32 
time schedule for which development of the system is 
feasible. This function say be used to set an alternative 
schedule for development. A new corresponding cost, effort 
and peak manpcwer will be provided. 


b. Design to Cost 


This function is used to allow the user to set a 
new cost for development. A new time schedule, level ot 
effort and peak manpower will be generated. 


Cc. Design to Effort 


This function is used to allow the user to set a 
new level or effort for development. A new time schedule, 
cost and peak manpower will he generated. 


d. Design to Risk 


This furction uses the trade-off law *ogetter 
with a user specified level of risk of not exceeding a 
required delivery date to generate an expected develorpaent 
time, level of effort, cost and peak manpover. 


e. Design to Reliability 


This function permits the user to input a speci- 
fied Mean Tise To Pailure(MTTF). It then deteraines tie 
appropriate development time, effort, cost and peak mar power 
to meet that MTTP together with tke estimated number of 
errors and error rates. 


f. Linear Prograa 


This function uses the technique of linear 
prograasing tc determire the minimum effcrt (ard cost) or 
the minimum time in vhich a system can be built. The results 
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are based on the actual manpower, cost, and schedule 
constraints cf the user, combined with the systen 
constraints descrited earlier to yield a constrained optinal 
sclution. 


g. Best Bid 


This function picks the best time-effort cosbi- 
nation ty maximizing the joint probability that the develop- 
sent tige is greater or equal toa user- speciried end cate 
and the cost is less or egual to a user- specified cost. 


ከ. Temporary Change 


This function lets the user teroorarily chance 
up to nine paraaeters- title, Start date, size, standard 
deviation on size, labor rate, uncertairty oa labor rate, 
inflaticn rate, productivity index and aanpower ouiliup 
index. 


3. Japlesention Opticon 


Once the estimator has found a solution that best 
suits the estimator's requiresent, the estimator wili need a 
set of detailed plans to implement the solution. By perioóci- 
cally ccmparinq progress of the development witn tae plan, 
the estiaator has tne Beans to control all phases of the 
software development from the peginning of the feasibility 
study through completion of operation and maintenance. The 
impleaentation options include: 


a.  Manloading 


This function prcvides prcjections of the near 
nuzber of pecple (and standard deviation) tnat wili be 
applied to the project throughout development. These 
projections are based op an optimal application of resources 
over tise. 
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b. Cashflow 


This function provides projections of the 
expected cashflow on ይ month-to-month basis througaout 
development of the system. 


©. Life-Cycle 


This function provides projections of the 
expected manpcwer and cashflow throughout the life-cycle of 
the syster. Projections are provided on a monthly, quar- 
terly, cr yearly basis. 


4. Risk Analysis 


This function determines the probability of 
developing a system within a specified time or for a speci- 
fied cost. it is very useful for strategic planning purposes 
by providing the associated with various tise and cost 
decisiors. 


e. Werk Breakdown 


This function shows a breakout of effort devoted 
to specific skill categories as a function of the develor- 
sent schedule. 


I. Effort Between Milestones 


This function shows a breakout of effort 
devoted to principal activities as a function of the devei- 
opment schedule. 


q. Milestones 


Based on a predetermined total development tize, 
this function provides a realistic schedule for the aajor 
silestones of the project. 
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h. Code 


This function provides a rate of code produc- 
tion and cusulative code productioa fur valid end product 
cost. Top down structured programsing is assumed. 


i. Gantt Chart 


This function provides a Gantt Chart of activity 
phases and their respective milestones. It will reflect 
whether the design and coding is done in a top-down struc- 
tured method or with a formal critical design review 
£ollowed by a coding phase. 


j. Front-End Estiaates 


This furction provides low, average, ard hign 
estimates or the time and effort required for the feasi- 
bility study aná design phase. 


k. CPU Usage 


This function provides a table of expected 
aachine usage over the life cycle of the systes, along wita 
an estimate c£ total machine hours required for developaent. 


l. Reliability 


This furction gives projections of error Tate, 
cuaulative errors created, found and fixed, and Mean rine To 
Failure month by sonth until a reliability of .9399 is 
achieved. 


5. Decusentation 


Based on tne total system size and data froa 
Lundceds of similar software systeas, a range of expected 
pages of documentation is given. 
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n. Summary of Development Costs 


This function prcvides a summary of the aajor 
developsent ccsts for the systea. These costs include labor 
costs aad CPU costs over tise and the total documentation 
ccst. 


o. Benefit Analysis 


This furction coaputes the benefit of the syste- 
required to amortize the cost of developaent and  zairte- 
parce. It is based on the anticipated economic life of tne 
systes as well as the average rate of return for the 
organization. 


The SLIN model furnishes an effective aeans to estiazate 
ard control the cost, schedule and manpover requirements of 
building and maintaining medium size to very large scftware 
systers. The program manager can tailor his estiaates to 
meet all realistic constraints iaposed on the developmert. 
After an intensive evaluation of the SLIN approach, the 
Departaent of Defense adopted the methodology as the stan- 
dari macro estimating techni que to be used on major software 
systeas in ıll services and defense agencies. 
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V. RROCEDORE AND DATABASE 


Most comsacn sethods of software costing start with esti- 
sation of the nusber of instructions. Boeha (Ref. 6] said 
the biggest difficulty in using today's algorithas in soft- 
ware cost models is the problem of providing sound sizing 
estimates. 

Putnam [ Ref. 1) suggests that in the systeas definition 
phase, we dc estimate a rough idea of the systen size 
(Source statements) and establish bounds on tanpower effort 
and developaent years. In tne requiresent ard specifications 
phase, the ruster or files, reports, ana application 
progrags are good estisators of the size of the system ani 
hence the time and effort required. 

This paper is an investigation of ways in which we aight 
be able to estimate the size of a new project, pased on 
Putnan's existing data base of close to one tnousarl 
systeas. Breakouts have been made of that data base by 
application type and the average size and the standard devi- 
ation of each of those application types has been calcu- 
lated. The estisator can te approacted in a Bayesian sense, 
such that if it is a scientific system, then it would fall 
sozewhere in the range for a scientific system. This would 
be the Bayesian prior. Then this category is presertei 
graphically, and the person asked whether it tends to be 
toward the low end of this category, toward the high erd of 
this category, cr low-aiddle or high-maiddle. By doing sone 
statistical analysis, it is possible to coge up with a 
veighted expected value and anew estizate or a standard 
deviation just ty using the person's notional choice and the 
هلع م‎ 2-51 218 formula. This would be a Bayesian likelihood 
estimate; it could be combined together vith the prior usiaj 
Bayes' theorem. The usefulness of the Bayesian approack to 





statistical inference rests upon the usefulness of incorpo- 
rating personal, subjective beliefs directly into the 
analysis. 

Proa Bayesian inference and decision, it nay be seen 
that the posterior protabilities are derived directly froz 
prior probabilities and the likelihoods. It should again be 
recalled that the mean and standard deviation of a norzally 
distributed random variable completely specify its prob- 
ability function. Further, it is easily provided by sears 
of the calculus that if the prior protability and the lixe- 
libood are both normally distributed, the posterior prot- 
abilities aust be normally distributed. The reciprocal of 
the posterior variance ( (^) is equal to the sum of tne 
reciprocal of the prior variance ( g? ) and the reciproca. 
of the likelihood variance ( g? ), reflecting the fact that 
the two sources of inforgmgaticn are pooled together. Tae 
posterior mean (E, ) is a weighted average of the pricr cean 
(Eo ) and the saaple near (E, ), the weights  teing tz: 
reciprocais of the respective variance (Pers. 16,17]. 212 


equations are as follows: 


E = -------- È e ------ 


E 
2 2 0 > 1 
% 6 


These postericr zeans and standard deviations will te usei 
as SLIM iaput parameters. 
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ል. 0585 DATA PASE 


QSM (Quantitative Software Managenent), Inc. is a company 
formed to help estiaators solve critical cost and schedule 
ccntrol problemas in business and governaent. The QSM data 
base is composed of one thousand systems from DOD, RADC (Rone 
Air Development Center), US Aray Cosputer Systezs Coanani, 
and various companies. Appendix G shows a saaple for data 
foraat. The (SM data base is composed of ten types as shown 
in Table 3. 


TABLE 3 
(SB DATA BASE BY APPLICATION TYPE 


rr 
(D 


e 


| 
| 
| 
| 
| 
| 


275 
Sicro ccdey Fira ware data pase 8 
keal-t13e data base 


p 


Avionic Jata “ነቦ 

systea Software data base 
CoósBa2nd 5 Control data dase 
Telecoa d B pam 
scientific ata bas 

Process control “dete Dase 
3usiness data سو‎ 
0 Unknowo applicatio 

Mixed Application data base 


| Application Type NO. Systen 


DD INNEN ወቃ 


— 


AVJ 

512 
7435 
6 870 
0186 
5035 
1452 
i 
1583 
1533 
000 
9549 


-— 4 ریا نل ے‎ 0 "di yn) ao 
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ኋ 
1 
5 
8 
7 
0 
4 
9 
2 
7 
بي‎ 
۴ 


ي لكا 





Table 4 shows that 2SN data base divided by size Caûje. 
B. PROCEDURE 


This idea ıs that by partitioning tne data base 
according to statistical sub-groups, we estimate new broject 
size using Bayesian inference. It pay be possible to come 
close enough to tbe true value to give meaningful software 
estiaates ın early feasibility study phases Of a Bajor 
project. The procedure consists of five steps. 
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TABLE 4 | 

ር5፳8 DATA BASE BY SIZE BANGE | 

Category range 22 سپ‎ ፥ 1n DB me | 
spcea nge ize 

ı Small K - 15K PUR - 20-8 279 " 8296 

$ ZU K- j K 10K - K 33 8343] 

590108 Large K -79K k- 1 K 0 3474 

à La K- 200K K- 2 K 79 120573 

Ver? Large 200K-6 OR 0K-600K 62 .318366 

Extra Large 600K-1000K 30300 6 :763060 

Super Large More than 1000K 750k-1500k 2 1260000| 


1. Detersine Application Type 


The different syster types kave unique properties 
which have to be considered so that parameters can be tuned 
to these properties. It is important to know what kind of 
software application type will be sade. Using each applica- 
tion type's mean and standard deviation, we get statistical 
tesults froz running the SLIM aodel. Appendix A shows the 


test case. In the test case, it is assumed that tize is 
estimated within 2 percent and effort within 9 percent. 
Appendix F shows the extreme case. In the extreme case, 


tire is estimated within 30 percent and ef.ort within 80 
percent. 


2. Select Size Category and its Qyantile 


Project size is a sajor factor that determines the 
level of manaqgesent control ard the types of tools anà teci- 
tiques required on a software project. Initially the ع٤‎ 17 
manager deteraicres the size category froa Table 4 and, basec 
on the possitle spread range, he considers the overiap on 
toth sides of the the category range. From application type 
and size range, the program maLager can choose the proper 
sean and standard deviation cf software size from Appeniix 
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C. It will be used for Bayesian prior estinates. Por 
exanple, let us select the category size of large. For this 
selection range of 75K-200K, the overlap range is graphi- 


cally represented in Figure 5.1. 


Ip---T-------- rz 


Pigure 5.1 Where it Palls in the Range (Large). 






Cum 
pros 


Now select a size range 2208 Low, Middle Low, Middle 
Higa, High. Assuming that the size is approrinately 
normally distributed, ve can use these as Bayesian likeli- 
hcod estiaates. These size ranges are also shown in Figure 
9.1. The program sanager believes it falls in the second 
guantile which we call “sid Low". Using weighted means and 
the PERT-sizing formula, the project manager can get the 
sean and standard deviation of software size. 


S = (50 + 4(125) #250) /6 = 133.3 (K) 
fs = ( 250 - 50 ( /6 = 33. (K) 


The other size categories and their respective quantiles are 
represented in Appendix D. 
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3. Coapute Nev Mean and Standard Leviation 


To get posterior sean and standard deviation as 
inpnt parameters for SLIM, ve use the Bayesian approach. He 
ccabine prior estimate froz application's type and size an] 
likelihood estimate from the progran sanager's subjective 
belief. 


u. Bua SLIS 


The project estimates provide all cost, schedule, 
Banpover, quality, and risk estimates for a software 
project. This output is used to determine the nminiaus tine 
developsent strategy and its associated uncertainty. Once a 
ainiaua .tiae solution is established the program 332332 
Should use the sanagement what-if options to impose project 
© 5523 When an acceptable solution is obtained, use 
the inplesentation reports and graphs to generate a set of 
work plans. 

5. Analysis 

To analyze the sensitivity of our solution to varia- 
tions in the size, we use the temporary change routine. The 
report for terporary changes will contain a sSumegeary of the 
changes anda new minimum time solution based upon the 
changes. Pesults of the ainiamua tise solution are output ia 
three tables. Tae first table, the siniaum time solution, 
gives a consistert set of sanagasent aetrics (3ean and stan- 
dard deviation) for building the systes in the stortest 
possible tise. The second table, 3 seasitivity prorile, 
shows hcv the sanagement metrics change as a result of OL» 
and three standard deviation changes in tne systea size 
(mean). Toe third table, a consistency cneck, cospares the 
estimator's solution with historical data for systeas of 
comparatle size io the QSA data base. 





All the variables of the software equation are 
subject to some degree of uncertainty and the  prograa 
Banager must have a means of taking this into account in an 
effort development risk profile. Putnam (Ref. 1) suggests 
risk analysis is probably the most important aspect of any 
software system analysis. In an uncertain process, risk 
analysis aeasures the uncertainty to let us develop proper 
strategies to minisaize the risk. 


C.  EIABPLE 


Let us examine a scientific type example in whick a new 
stand-alone development project will be wade. Initially the 
prograa manager believes that the project size category vill 
be large. 3y observing the graphical representation of the 
range, the program aanager subjectively estinates that tise 
project falls into "Mid Low" range. For a scientific 
"Larje" project, we can derive prior estinates for the mear 
and standard deviation of 117389 and 34508 cespectively froa 
Appendix ©. Also we calculate likelihood estimates of sein 
and standard deviation of 13330) and 33000 respectively fron 
Appendix D, where it falls in the "Mid Low" range. Fron 


Bayesian equations, we can get posterior 6513281954, as 
follows. 
1 1 8 1 1 
6; 2 345082 330002 238502 
38502 238502 
5 z u E 117389 + enana 133309 = 125700 
2 345082 330002 
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These estinates vill be used as SLIN input parameters. We 
assume that this project uses average manpower and produc- 
tivity index. Prom running the SLIM model, we get the 
Binimua time solution. Knowing the minimum development tine 
and standard deviation give the program manager a framework 
from which to plan and control. the software development 
process. From Table 5, for the minimum time solution, the 
Program aanag«r should consider the estiaate having a 50 
percent chance of peing realized upon completion of the 
systea. When this expected value is added to its standard 
deviation, it gives a new value that has an 84 percent 
chance of being realized. Alsc the percent of expected value 
of the standard deviation isa majcr factor. For this 
example it is as follows: 


er / t, s 2.1 23.4 = .989 
d d ۴ 


Gz / f = 139 / 516.5 = .269 


This shows that development tiae 18 estimated within 9 
percent ang effort within 27 percent in one standard devia- 
tion. The cther range results are shown in Appendix >. 
Appendix F shows it for business applications. 

Table 6 shows the corresponding expected values for 
time, effort and cost by increasing cne and three standard 
deviaticn of system size. Also it shows that a 99 percent 
confidence interval of develcpment time is between 16.2 


Bacnth and 28.2 month. 

Table 7 shows that if the values shown are within plus 
or ainus 45 percent of the mean for systens of coaparable 
size in the data base, tce table labels thea as "in normal 
range". Otherwise the values will be reported as either 
greater than cr less than the norzal range. 








TABLE 5 
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TITLE! Thesis Example DATE: 20-SEP-1985 | 

| TIME 11127146 | 

EXPECTED VALUE | 

MANAGEMENT METRIC (50% PROBABILITY) STD DEV | 
SYSTEM SIZE (STATEMENTS) 125700 23850 
MINIMUM DEVELOPMENT TIME (MONTHS) 23.4 2.1 

DEVELOPMENT EFFORT (MANMONTHS) 516.5 139.0 

DEVELOPMENT COST’ (x 1000 8) i 

(UNINFLATED) 2178 668 | 

(INFLATED) 3% 747 | 

PEAK MANPOWER (PEOPLE) 34 - | 





لم 


TABLE 6 
SENSITIVITY PROPILE 


UNINFLATED 
SOURCE STMTS . MONTHS MANMONTHS COST (X 19000) 
-5 STD DEV 54150, 16.2 162. 575. 
-1 STD DEV 101850. mo 후 396. 1607. 
EXPECTED 123700. 23.4 216. 2178. 
+i STD DEV 149350. 5.1 s 653. 26 
+; STD DEV 197230. -8.2 903. 37682. 


= = - ㅡ - eee — 


 ጩ 5፡-== .5:ፎ5ጭ.‹ርርፎ: ‹‹.ጠወ ባ፡፡ سوا ےے‎ gee SEE ce ET — — —— — 


Figure 5.2 shows the major ailestone time for tke 
current solution as a statistical expected walue, i.2. , the 
value that statistically has a 50 percent probability of not 
being exceeded upon completion of the aain program. 

Table 8 stows the probability ranges for effort, cost, 
and time íroa 1 percent to 399 percent --a Frooad enough band 
to cover all realistic possibilities. 
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TABLE 7 
COMSISTENCY TABLE 


MANAGEMENT METRIC VALUE 


E ASSESSMENT 
bed — 23.4 IN NORMAL RANGE 

Y (MANMONTHS) 316 IN NORMAL RANGE 
AVERAGE STAFF ING (PEOPLE) == 


መ= IN NORMAL RA 
PRODUCTIVITY (LINES/MM) 243 NGE 


IN NORMAL RANGE 


CE lt ATT — —— 








e < a MD a RD aa: aa E aaa E mma aD «aD‏ جھے y OQ‏ ہیے۔ 


figure 5.2 Risk Profile (time). 


Pros the tables and graph, the program manager deter- 
mines the probability that other times, effort, and costs 
will not be exceeded. Develogaent time and ailestones are 
the aost sensitive elements in the process. Milestones scale 
linearly, aeaning that if the project aanager is late on an 
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TABLE 8 | 

















PROBABILITY PROFILE | 
UN INFLATED | 
PROBABILITY MANMONTHS COST (X 5 1009) TIME (MONTHS) 
1 * 193 ens. جو پری‎ 
5 * 208 1080 20.0 
10 X 3-28 1322 20.8 
20 x 400 1616 21.7 
30 X 344 1028 2.2 
40 ጄ 401 2009 2a. 9? 
SO % $16 2178 =. 
ሴን ጄ = 2247 23. 
79 x 389 2929 24.5 
80 x ecl 2740 23.2 
$0 X 693 3033 \ 26.1 
93 % 
99 % 


242 2276 26.8 
040 2771 28.2 


early ailestone, the project manager will very likely be 





late on succeeding ailestones. It is ¡important for the 
program manager to remember Brooks’ law: Adding manpower to 
a late project makes it later. 
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VI. CONCLUSION AND RECOBSEBDATIONS 


In order for the programs nanager and his support tean 
to evaluate a software cost proposal, they must have a 
detailed understanding of the process that developed the 
cost estimate apnd should have some traceanpility to the vari- 
ances or sensitivity of the estimating cosponents. It is 
important for the  prograa manager to recognize the 
sinilarities/differences between the defense systen life 
cycle and the other consercial development life cycle. Tae 
program manager should compare various cost acdels for 










alternative estinates. 

The life-cycle curve can te sathesatically representej 
by the Bayleiga equation, which is a good aanagement tcol. 
Por planning and evaluating the developaent effort iif2 
cycle, the SLIM model is a very powerful tool. 

It is hard to estimate scitware size in tne early phase 
of systema developmert. but the program ganager can estisate 
the new project size using Bayesian iafereace. To jet a 
Bayesian estiaate of software size, the program narajer 
combines a prior estiaate from application's type ari 
category size with a likelihcod estimate fro» the progran 
manager's subjective belief. Waen combined with the capa- 
bilities of the SLIM model, the developsent tize is esti- 
mated within 15 percent and effort within 3) percent in ore 
standard deviation. 

This methed couid be a fruitful way to get at the early 
estimating prcpblea in a gross vay, yet good enough to get ir 
the "ball park" of what people need at that phase of the 
project. 

Future research should focus on the relationship between 
cost attribute factors and systea application types. 
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ARPREBDIZ A 


DISTRIBUTION BY APPLICATION TYPE(BEST CASE) 


Application Type 


Micro code/PFirm vare 
Real-time 

Avionic 

system Software 
Conmand 5 Control 
Telecoa 

Scientific 

Process Control 

Busi sess 

“Mixed Application 


Developaent Time 


Mean. 
10.00 
16.59 
19.20 
18.69 
17.20 
14.61 
20.31 
16.06 
18.29 
18.05 
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Std. Dev 
.21 
- 38 
2 44 
- 41 
. 38 
0 
. 5 
37 
. 0 
- 38 





Effort 
Mean. Std. Dev 
16.0 1.3 
174.0 15.5 
282.9 25.4 
257.0 22.3 
192.7 16.7 
108.1 8.3 
334.3 25.9 
156.1 14.3 
243.8 20.5 
233,5 1222 








ARPEBDIX ፲፻ 
DISTRIBUTION BY APPLICATION TYPE (EXTREME CASE) 


vo C -J O NY E WwW Ww = 


Application Type Development Tize Effort 
Mean. Std. Dev Mean. Std.Dev 
Micro codeysFira ware 10.43 3. 68 24.6 20.2 
Real-tine 21.37 7.10 498.4 423.8 
Avionic 22.04 2. M 533.7 457,8 
systen Software 21.68 6. 85 515.1 411.9 
Command E Control 19.65 6. 70 400.6 351.0 
Telecon 16.33 5.49 202.5) 172.8 
Scientific 25.80 10. 12 975.8 912.0 
Process Control 18.17 5. 29 276.9 206.4 
Susi ness 21.73 7.15 529.9 427.1 
4ixed Application 20.98 7. 36 477.6 417.7 
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ARRENDII $ 
QS8 SAMPLE DATA 





Run Date: 06-25-".985 PROJECT DETAIL REPORT Page: 
Run Time: 11:00:59 Project: GROSS MARGIN ANLYS 
Date: 05/21/85 Project name : GROSS MARGIN ANLYS 
Organization: System number: 964 


SIZING INFORMATION 
Total Source Statements: 23703 





Name of Language Type % of Total Size 
COBOL High Level 22 + 
NATURAL Non-Procedural 71 $ 
JCL High Level 7 i 
00 E 


% of Total Source Statements 
Brand New (ss): 100 % 





Modified (ss): V B 
Unmodified (ss): % 9 
TIME-EFTORT-STAFTING 
Feasibility Study: mos .5 mm | 
Functional Design: 4 mos 3.5 mm 1.5 Pk Mpwr Overlap (mos) 
Main Software Build: 3 mos 10 mm 7 Pk Mpwr Cost 
Operations + Maint.: 3 mos 2.5 mm 
FOC Date: 01/85 
OVERRUN/SLIPPAGE (MAIN SOFTWARE BUILD) 
Time Slippage: mos Cost Overrun: (mm) 





QUALITY (MAIN SOFTWARE BUILD) 


Errors (SIT-FOC): MTTF (1st mo oper): days 
Errors (ist mo after FOC): 13 


PROJECT CONSTRAINTS (MAIN SOFTWARE BUILD) 





Cost: People: 7 
Time: 8 mos Computer: Severe 

ENVIRONMENT 
Average Personnel Skill Experience 
Overall: Mediun Similar Systems: Mediun Languages: Medium 
Computer: Medium Methodologies: Software Aids: High 
Mgmt Team: High Staff Turnover: Tools + Utils: Good 
Response Tine: 5 Minutes - 1 Hour 


Development Computer: IBM 3033 
Memory Occupancy: ۹ 
Real Time Code: — ኒ 
Requirements Change: 2% 
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Run Date: 06-25-1985 PROJECT DETAIL REPORT Page: 2 


Run Time: 11:01:01 Project: GROSS MARGIN ANLYS ۰ 

APPLICATION | 
Application Type: Business 
System Features: 


Data Base 


























SYSTEM DESIGN COMPLEXITY 
Algorithms mostly known but logic designed from scratch. Interfaces were 
straight forvard. 


CALCULATED FIELDS 


Productivity Index: 21 Manpower Buildup Index: 6 

Total Developed, Delivered, Executable Source Lines of Code: 237023 
USER FIELDS 

1) 2) 

3) USA 4) CON 


PROJECT DESCRIPTION | 
ONLINE REPORT REQUESTING WITH 3 SCREENS OF REPORT SELECTION CRITERIA AVAILABLE 





REPORTS SHOW GROSS MARGIN INFORMATION FOR SEGMENTS OF BUSINESS NIGHT BATCH RUN 


FACTORS THAT HAD A POSITIVE OR NEGATIVE IMPACT ON THIS PROJECT 
-LACK OF CLEAR INFORMATION ON DATA WHICH IS USED TO GENERATE REPORTS LED TO 
INCORRECT ASSUMPTIONS ERRORS WERE FOUND DURING TESTING WHICH CAUSED THIS PHASE 


TO TAKE LONGER THAN PLANNED. 
-ONE BATCH PROGRAM NEEDED ABNORMALLY LARGE TABLE REDESIGN NECESSARY TO MEET 
PERFORMANCE CRITERIA 
-TIGHT SCHEDULE +USER PROJECT MONITOR DID A THROUGH TESTING JOB 
+ TEAM WORKED AT NIGHT TO GET ACCEPTABLE TURNAROUND ON THE CPU THIS GOT THE 
JOB DONE BUT HAD A ADVERSE EFFECT ON THE TEAM PHYSICALLY 
TOOLS, UTILITIES, COMPUTER BASED AIDS & METHODOLOGIES USED ON THIS PROJECT 
TOOLS : 
SAS TEST & DEBUG TOOL 
NATURAL 
ADABASE 
MAYFLOWER SDM SYSTEM DEVELOPMENT METHODOLOGY 


M n t ua MN A © به‎ ses oa ቁ m a > © a- ቤዜ * ቁፋ © r:o i> too -" v» © © -— "^ وي‎ a “ a '" 
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